Претрага ⚒ Радови ⚒ Др РГФ - Репозиторијум РГФ

Претрага

Per page

Sort by

68 items

Development of Open Educational Resources (OER) for Natural Language Processing

Cvetana Krstev, Biljana Lazić, Ranka Stanković, Giovanni Schiuma, Miladin Kotorčević (2015)

In this paper we present the development of an online course at the edX BAEKTEL platform named “Lexical Recognition in the Natural Language Processing (NLP)”. It is based on the course of the same name for PhD studies at the University of Belgrade, Faculty of Philology. There are not many courses in Computational Linguistics (CL) on OER platforms, and there is none in Serbian either for CL or NLP. We have developed this course in order to improve this ...

E-Learning, Open Educational Resources, Computational Linguistics, Lexical Resources, edX

... 2. COMPUTATIONAL LINGUISTICS AND NATURAL LANGUAGE PROCESSING Computational linguistics (CL) is a theoretical discipline between linguistics and computer science concerned with understanding and modelling the written and spoken language from a computational aspect.[3]Natural ...
... There are not many courses in Computational Linguistics (CL) on OER platforms, and there is none in Serbian either for CL or NLP. We have developed this course in order to improve this situation as it can prove useful both for linguists working in corpus linguistics and computer scientists developing ...
... knowledge to use the existing resources for NLP for Serbian and to develop new ones. Keywords: E-Learning, Open Educational Resources, Computational Linguistics, Lexical Resources, edX 1. INTRODUCTION Open educational resources (OER) publicly available on the web are growing quickly and are ...
Cvetana Krstev, Biljana Lazić, Ranka Stanković, Giovanni Schiuma, Miladin Kotorčević. "Development of Open Educational Resources (OER) for Natural Language Processing" in The Sixth International Conference on e-Learning (eLearning-2015), September 2015, Belgrade, Serbia, Belgrade : Belgrade Metropolitan Univesity (2015)
Веб-алат за управљање грађом Речника САНУ и анотација листића

Рада Стијовић, Ранка Станковић, Михаило Шкорић (2020)

Грађа на основу које се израђује Речник српскохрватског књижевног и народног језика САНУ, а која садржи материјал из преко 4.500 писаних извора и 300 рукописних збирки речи са подручја народних говора штокавског наречја, забележена је на око 5.000.000 листића. Богат лексички материјал, који обухвата књижевни и народни језик у протекла два века и на основу кога треба да се напише још најмање 15 томова Речника, пружа могућност и за разноврсна лингвистичка и ванлингвистичка истраживања. Из тог разлога се приступило ...

лексикографска грађа, листићи, лексикографски алат, дигитализација, анотација

Рада Стијовић, Ранка Станковић, Михаило Шкорић. "Веб-алат за управљање грађом Речника САНУ и анотација листића" in Rasprave Instituta za hrvatski jezik i jezikoslovlje, Institute of Croatian Language and Linguistics (2020). https://doi.org/10.31724/rihjj.46.2.32
EUROLAN 2021: Introduction to Linked Data for Linguistics Online Training School

Milan Dojchinovski, Julia Bosque Gil, Jorge Gracia, Ranka Stanković (2021)

Prva škola za obuku polaznika koju je organizovala COST akcija NexusLinguarum održana je od 8. do 12. februara 2021. godine sa ciljem da studenti, istraživači i stručnjaci nauče osnove lingvističke nauke o podacima. Tokom obuke polaznici su se upoznali sa širokim spektrom tema: od semantičkog veba, RDF -a i ontologija, do modeliranja i pretraživanja jezičkih podataka pomoću najsavremenijih ontoloških modela i alata. Škola je održana u okviru serije letnjih škola EUROLAN-a i organizovalo ju je virtuelno (onlajn) nekoliko instituta; ...

nauka o lingvističkim podacima, povezani podaci u lingvistici, jezički podaci, EUROLAN, NexusLinguarum, COST akcija, škola za obuku

... and covers topics that are particularly relevant to the fields of computational linguistics and natural language processing (NLP). The goal of this 15th EUROLAN School was to bring together scholars, teachers and students of linguistics, NLP and information technology to discuss the principles and best ...
... Introduction to Linked Data for Linguistics Online Training School Milan Dojchinovski, Julia Bosque Gil, Jorge Gracia, Ranka Stanković Дигитални репозиторијум Рударско-геолошког факултета Универзитета у Београду [ДР РГФ] EUROLAN 2021: Introduction to Linked Data for Linguistics Online Training School | ...
... employees' publications. - The Repository is available at: www.dr.rgf.bg.ac.rs Professional paper EUROLAN 2021: Introduction to Linked Data for Linguistics Online Training School UDC 81: 37.018.53 DOI 10.18485/infotheca.2021.21.1.7 ABSTRACT: The first training school or- ganized by the NexusLinguarum ...
Milan Dojchinovski, Julia Bosque Gil, Jorge Gracia, Ranka Stanković. "EUROLAN 2021: Introduction to Linked Data for Linguistics Online Training School" in Infotheca, Faculty of Philology, University of Belgrade (2021). https://doi.org/10.18485/infotheca.2021.21.1.7
Frequency and Length of Syllables in Serbian

Marija Radojičić, Biljana Lazić, Sebastijan Kaplar, Ranka Stanković, Ivan Obradović, Ján Mačutek, Lívia Leššová (2019)

Basic analyses of several properties of syllables (the rank-frequency distribution, the distribution of length, and the relation between length and frequency) in Serbian is presented. The syllabification algorithm used combines the maximum onset principle and the sonority hierarchy. Results indicate that syllables behave similarly to words as far as mathematical models are concerned, but values of parameters in models for syllables are quite different from those for words.

frekvencije slogova, dužina slogova, srpski jezik

... Issues in Quantitative Linguistics 2: 136-147. Lüdenscheid: RAM-Verlag. Frequency and Length of Syllables in Serbian 123 Mačutek, J., Wimmer, G. (2013). Evaluating goodness-of-fit of discrete distribution models in quantitative linguistics. Journal of Quantitative Linguistics 20, 227-240. Meštrović ...
... Press. Crystal, D. (2008). A Dictionary of Linguistics and Phonetics. Oxford: Blackwell. Eddington, D., Treiman, R., Elzinga, D. (2013a). Syllabification of American English: Evidence from a large-scale experiment. Part I. Journal of Quantitative Linguistics 20, 45-67. Eddington, D., Treiman, R. ...
... bivariate model. Journal of Quantitative Linguistics 20, 241-251. Koščová, M., Mačutek, J., Kelih, E. (2016). A data-based classification of Slavic languages: Indices of qualitative variation applied to grapheme frequencies. Journal of Quantitative Linguistics 23, 177-190. Köhler, R. (2005). Properties ...
Marija Radojičić, Biljana Lazić, Sebastijan Kaplar, Ranka Stanković, Ivan Obradović, Ján Mačutek, Lívia Leššová. "Frequency and Length of Syllables in Serbian" in Glottometrics (2019)
A Method for Extracting Translational Equivalents from Aligned Texts

Obradović Ivan (2013)

Obradović Ivan. "A Method for Extracting Translational Equivalents from Aligned Texts" in Methods and Applications of Quantitative Linguistics, Selected papers of the 8th International Conference on Quantitative Linguistics (QUALICO) in Belgrade, Serbia, April 26-29, 2012, Ivan Obradović, Emmerich Kelih, Reinhard Köhler (eds.), :University of Belgrade & Academic Mind (2013): 119-129
Quantitative analysis of syllable properties in Croatian, Serbian, Russian, and Ukrainian

Biljana Rujević, Marija Kaplar, Sebastijan Kaplar, Ranka Stanković, Ivan Obradović, Jan Mačutek (2021)

slogovi, distribucija rang-preciznost, slovenski jezici

Biljana Rujević, Marija Kaplar, Sebastijan Kaplar, Ranka Stanković, Ivan Obradović, Jan Mačutek. "Quantitative analysis of syllable properties in Croatian, Serbian, Russian, and Ukrainian" in Language and Text: Data, models, information and applications, John Benjamins Publishing Company (2021). https://doi.org/10.1075/cilt.356.04ruj
E-Dictionaries and Finite-State Automata for the Recognition of Named Entities

Krstev Cvetana, Vitas Duško, Obradović Ivan, Utvić Miloš (2011)

Krstev Cvetana, Vitas Duško, Obradović Ivan, Utvić Miloš. "E-Dictionaries and Finite-State Automata for the Recognition of Named Entities" in Proceedings of the 9th International Workshop on Finite State Methods and Natural Language Processing, FSMNLP 2011, July 2010, Blois, France, A. Maletti and M. Constant (eds.), :Association for Computational Linguistics (2011): 48-56
Corpus-based bilingual terminology extraction in the power engineering domain

Tanja Ivanović, Ranka Stanković, Branislava Šandrih Todorović, Cvetana Krstev (2022)

Ovaj rad predstavlja resurse i alate koji se koriste za ekstrkciju i evaluaciju dvojezične, englesko-srpske terminologije u domenu energetike. Resursi se sastoje od postojeće opšte i domenske leksike i domenskog paralelnog korpusa; alati uključuju ekstraktore termina za oba jezika i alat za poravnavanje segmenata koji pripadaju korpusnim rečenicama. Sistem je testiran variranjem funkcije podudaranja koja utvrđuje prisustvo ekstrahovanog termina u poravnatom segmentu (odsečak), u rasponu od veoma labavog do strogog. Procena rezultata je pokazala da je preciznost izdvajanja termina ...

Library and Information Sciences, Communication, Language and Linguistics

Tanja Ivanović, Ranka Stanković, Branislava Šandrih Todorović, Cvetana Krstev. "Corpus-based bilingual terminology extraction in the power engineering domain" in Terminology, John Benjamins Publishing Company (2022). https://doi.org/10.1075/term.20038.iva
Extraction of Bilingual Terminology Using Graphs, Dictionaries and GIZA++

Branislava Šandrih, Ranka Stanković (2020)

U nauci, industriji i mnogim istraživačkim oblastima, terminologija se brzo razvija. Najčešće, jezik koji je „lingua franca“ za većinu ovih oblasti je engleski. Kao posledica toga, za mnoga polja termini domena su koncipirani na engleskom, a kasnije se prevode na druge jezike. U ovom radu predstavljamo pristup za automatsko izdvajanje dvojezične terminologije za englesko-srpski jezički par koji se oslanja na usaglašeni dvojezični korpus domena, ekstraktor terminologije za ciljni jezik i alat za usklađivanje delova. Ispitujemo performanse metode na domenu ...

ekstrakcija terminologije, validacija terminologije, GIZA++, grafovi, Unitex, klasifikacija teksta

... Processing and Computational Linguistics (CICLING2016), LNCS. Springer, 2016 Hazem, Amir and Emmanuel Morin. “Efficient Data Selection for Bilin- gual Terminology Extraction from Comparable Corpora”. In Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical ...
... Expressions from Small Parallel Corpora”. In Proceedings of the 23rd International Conference on Computational Linguistics: Posters, COLING ’10, 1256– 1264. Stroudsburg, PA, USA: Association for Computational Linguistics, 2010. http://dl.acm.org/citation.cfm?id=1944566.1944710 Vintar, Špela and Darja Fǐser ...
... . In Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology, Vol. 1, 48–54. Association for Computa- tional Linguistics, 2003. Infotheca Vol. 19, No. 2, December 2019 135 Šandrih B., Stanković R., “Extraction ...
Branislava Šandrih, Ranka Stanković. "Extraction of Bilingual Terminology Using Graphs, Dictionaries and GIZA++" in Infotheca, Faculty of Philology, University of Belgrade (2020). https://doi.org/10.18485/infotheca.2019.19.2.6
Digital Library From A Domain Of Criminalistics As A Foundation For A Forensic Text Analysis

Dalibor Vorkapić, Aleksandra Tomašević, Miljana Mladenović, Ranka Stanković, Nikola Vulović (2017)

U ovom radu predstavljen je model koji omogućava prikupljanje, pripremu, opis metapodataka, upravljanje i eksploataciju, uključujući pretragu punog teksta dokumenata iz domena kriminalistike napisanih na srpskom jeziku. Predloženi pristup primenjuje se na veb portalu koji sakuplja različite tekstove nastale iz časopisa Akademije za kriminalistiku i policijske studije, Krivičnog zakona Srbije, konferencija „Tara“ i „Reiss“, kao i iz nekih doktorskih disertacija vezanih za ovu oblast istraživanje. Nakon obrade teksta, korpus koji sadrži preko 5500 stranica običnog teksta, kreiran je i ...

Omeka, Wordnet, pretraga punog teksta, morfološka i semantička pretraga teksta, proširenje upita

... on which administrator regulates the Digital Library 3 https://www.gnu.org/licenses/gpl-3.0.en.html http://master-kpa.rgf.rs/ FORENSIC LINGUISTICS The linguistic study of forensic texts is a part of the field of Natural Language Processing, which includes text types classification and ...
... element of revenge in their last moments. Death row statements are within the heavily institutionalized setting of death row prisons. The Forensic Linguistics Institute holds a corpus of these documents and is conducting research on them. And the last but not the least important is social media. Social ...
... enhances the potentials of manipulating each particular resource as well as several resources simultaneously10. 4 John Olsson (2008).. Forensic Linguistics, Second Edition. London: Continuum ISBN 978-0-8264-6109-4 5 Falzini, Mark W. (9 September 2008). "The Ransom Notes: An Analysis of Their Content ...
Dalibor Vorkapić, Aleksandra Tomašević, Miljana Mladenović, Ranka Stanković, Nikola Vulović. "Digital Library From A Domain Of Criminalistics As A Foundation For A Forensic Text Analysis" in International Scientific Conference “Archibald Reiss Days” Thematic Conference Proceedings Of International Significance, Belgrade, 7-9 November 2017, Academy Of Criminalistic And Police Studies Belgrade (2017)
A Multilingual Evaluation Dataset for Monolingual Word Sense Alignment

Sina Ahmadi, John P McCrae, Sanni Nimb, Fahad Khan, Monica Monachini, Bolette S Pedersen, Thierry Declerck, Tanja Wissik, Andrea Bellandi, Irene Pisani, [...] Ranka Stanković and others (2020)

Aligning senses across resources and languages is a challenging task with beneficial applications in the field of natural language processing and electronic lexicography. In this paper, we describe our efforts in manually aligning monolingual dictionaries. The alignment is carried out at sense-level for various resources in 15 languages. Moreover, senses are annotated with possible semantic relationships such as broadness, narrowness, relatedness, and equivalence. In comparison to previous datasets for this task, this dataset covers a wide range of languages ...

lexical semantic resources, sense alignment, lexicography, language resource

... In Pro- ceedings of the 21st International Conference on Compu- tational Linguistics and the 44th annual meeting of the Association for Computational Linguistics, pages 105– 112. Association for Computational Linguistics. Niemann, E. and Gurevych, I. (2011). The people’s web meets linguistic knowledge: ...
... disambiguation rivaling supervised systems. In Proceedings of the 48th annual meeting of the asso- ciation for computational linguistics, pages 1522–1531. Association for Computational Linguistics. Pustejovsky, J. (1995). The Generative Lexicon. MIT Press, Cambridge, MA. RAE, R. A. E. (2001). Diccionario ...
... Association for Computational Linguistics, 1:151–164. Matuschek, M. and Gurevych, I. (2014). High perfor- mance word sense alignment by joint modeling of sense distance and gloss similarity. In Proceedings of COL- ING 2014, the 25th International Conference on Compu- tational Linguistics: Technical ...
Sina Ahmadi, John P McCrae, Sanni Nimb, Fahad Khan, Monica Monachini, Bolette S Pedersen, Thierry Declerck, Tanja Wissik, Andrea Bellandi, Irene Pisani, [...] Ranka Stanković and others . "A Multilingual Evaluation Dataset for Monolingual Word Sense Alignment" in Proceedings of the 12th Conference on Language Resources and Evaluation (LREC 2020), Marseille, European Language Resources Association (ELRA) (2020)
Rule-based Automatic Multi-word Term Extraction and Lemmatization

Ranka Stanković, Cvetana Krstev, Ivan Obradović, Biljana Lazić, Aleksandra Trtovac (2016)

In this paper we present a rule-based method for multi-word term extraction that relies on extensive lexical resources in the form of electronic dictionaries and finite-state transducers for modelling various syntactic structures of multi-word terms. The same technology is used for lemmatization of extracted multi-word terms, which is unavoidable for highly inflected languages in order to pass extracted data to evaluators and subsequently to terminological e-dictionaries and databases. The approach is illustrated on a corpus of Serbian texts from ...

term extraction, terminology, multi-word units, lemmatization, finite-state transducers

... language in logic and the 4th conference on Computational natural language learning -Volume 7, Stroudsburg: Association for Computational Linguistics, pp. 67--72. Smadja, F.(1993). Retrieving Collocations from Text: Xtract, Computational Linguistics, 19(1), pp. 143-177. Stanković, R., Obradović ...
... Springer Berlin Heidelberg, pp. 1160--1165. Church, K. W., Hanks, P., (1990). Word association norms, mutual information, and lexicography, Computational Linguistics, 16, pp. 22--29. Church, K. W. Gale, W., Hanks, P., Hindle, D. (1991). Using statistics in lexical analysis, In U. Zernik (Ed.), Lexical ...
... Engine: ten years on. Lexicography 1(1), pp.7--36. Dunning, T. (1993). Accurate methods for the statistics of surprise and coincidence. Computational Linguistics, 19(1), pp.61--74. Frantzi, K., Ananiadou, S., and Mima, H. (2000). Automatic recognition of multi-word terms:.the C-value/NC-value ...
Ranka Stanković, Cvetana Krstev, Ivan Obradović, Biljana Lazić, Aleksandra Trtovac. "Rule-based Automatic Multi-word Term Extraction and Lemmatization" in Proceedings of the 10th International Conference on Language Resources and Evaluation, LREC 2016, Portorož, Slovenia, 23--28 May 2016, European Language Resources Association (2016)
Two approaches to compilation of bilingual multi-word terminology lists from lexical resources

Branislava Šandrih, Cvetana Krstev, Ranka Stanković (2020)

In this paper, we present two approaches and the implemented system for bilingual terminology extraction that rely on an aligned bilingual domain corpus, a terminology extractor for a target language, and a tool for chunk alignment. The two approaches differ in the way terminology for the source language is obtained: the first relies on an existing domain terminology lexicon, while the second one uses a term extraction tool. For both approaches, four experiments were performed with two parameters being ...

Linguistics and Language,Software,Artificial Intelligence,Language and Linguistics

Branislava Šandrih, Cvetana Krstev, Ranka Stanković. "Two approaches to compilation of bilingual multi-word terminology lists from lexical resources" in Natural Language Engineering, Cambridge University Press (CUP) (2020). https://doi.org/10.1017/S1351324919000615
Ontološki model upravljanja rizikom u rudarstvu

Olivera Kitanović (2021)

Rudarska proizvodnja obuhvata kompleksne tehnološke sisteme, što nameće potrebu za uspostavljanjem i unapređivanjem sistema upravljanja rizikom. Heterogenost i obim podataka neophodnih za upravljanje rizikom zahtevaju sistem koji ih na fleksibilan način integriše i omogućava njihovo optimalno korišćenje. Osnovni cilj ove disertacije je razvoj ontologije za domen rudarstva i na njoj zasnovanog modela za upravljanje rizikom. Njegova realizacija podrazumeva i implementaciju algoritama ekstrakcije informacija za popunjavanje ontologije, kao i odgovarajuće softversko rešenje. Razvoj modela obuhvata i značajno proširenje rudarskog korpusa, kao ...

rudarstvo, rizik, upravljanje rizikom, procena rizika, ontologija, semantička mreža, ekstrakcija informacija, upravljanje znanjem, računarska lingvistika

... model includes a significant expansion of the mining corpus, as well as the creation of a terminological database, realized using methods of computational linguistics and a corpus of documents from the mining domain (plans, reports, laws, textbooks and monographs). For systematization of data a descriptive ...
... Framework for Mining Enterprise Risk and Risk Factors from News Documents.” In Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: System Demonstrations, 180–84. Osaka, Japan: The COLING 2016 Organizing Committee. https://www.aclweb.org/anthology/C16-2038. Dechow, Patricia ...
... Marti A. 1992. “Automatic Acquisition of Hyponyms from Large Text Corpora.” In Coling 1992 Volume 2: The 15th International Conference on Computational Linguistics. Henley, Ernest J, and Hiromitsu Kumamoto. 1996. “Probabilistic Risk Assessment and Management for Engineers and Scientists.” IEEE Press (2nd ...
Olivera Kitanović. Ontološki model upravljanja rizikom u rudarstvu, Beograd : [O. Kitanović], 2021
From ELTeC Text Collection Metadata and Named Entities to Linked-data (and Back)

Milica Ikonić Nešić, Ranka Stanković, Christof Schöch and Mihailo Škorić (2022)

In this paper we present the wikification of the ELTeC (European Literary Text Collection), developed within the COST Action ``Distant Reading for European Literary History'' (CA16204). ELTeC is a multilingual corpus of novels written in the time period 1840—1920, built to apply distant reading methods and tools to explore the European literary history. We present the pipeline that led to the production of the linked dataset, the novels’ metadata retrieval and named entity recognition, transformation, mapping and Wikidata population, ...

Wikidata, linked data, SPARQL, distant reading, literary corpus, named entity linking, ELTeC

Milica Ikonić Nešić, Ranka Stanković, Christof Schöch and Mihailo Škorić. "From ELTeC Text Collection Metadata and Named Entities to Linked-data (and Back)" in Proceedings of The 8th Workshop on Linked Data in Linguistics within the 13th Language Resources and Evaluation Conference, June 2022, Marseille, France, European Language Resources Association (2022)
Using English Baits to Catch Serbian Multi-Word Terminology

Cvetana Krstev, Branislava Šandrih, Ranka Stanković (2018)

In this paper we present the first results in bilingual terminology extraction. The hypothesis of our approach is that if for a source language domain terminology exists as well as a domain aligned corpus for a source and a target language, then it is possible to extract the terminology for a target language. Our approach relies on several resources and tools: aligned domain texts, domain terminology for a source language, a terminology extractor for a target language, and a ...

aligned texts, word alignment, terminology extraction, electronic dictionaries, morphological inﬂection

... n, pages 187–197. Association for Computational Linguistics. Koehn, P., Och, F. J., and Marcu, D. (2003). Statistical Phrase-based Translation. In Proceedings of the 2003 Conference of the North American Chapter of the As- sociation for Computational Linguistics on Human Lan- guage Technology-Volume ...
... Statistical Alignment Models. Computational linguistics, 29(1):19–51. Papineni, K., Roukos, S., Ward, T., and Zhu, W.-J. (2002). BLEU: a Method for Automatic Evaluation of Machine Translation. In Proceedings of the 40th annual meeting on association for computational linguistics, pages 311– 318. Association ...
... parallel corpora. In Pro- ceedings of the 23rd International Conference on Com- putational Linguistics: Posters, COLING ’10, pages 1256–1264, Stroudsburg, PA, USA. Association for Computational Linguistics. Vintar, Š. and Fišer, D. (2008). Harvesting multi-word ex- pressions from parallel corpora ...
Cvetana Krstev, Branislava Šandrih, Ranka Stanković. "Using English Baits to Catch Serbian Multi-Word Terminology" in Proceedings of the 11th International Conference on Language Resources and Evaluation, LREC 2018, Miyazaki, Japan, May 7-12, 2018, European Language Resources Association (ELRA) (2018)
Resource-based WordNet Augmentation and Enrichment

Ranka Stanković, Miljana Mladenović, Ivan Obradović, Marko Vitas, Cvetana Krstev (2018)

In this paper we present an approach to support production of synsets for SerbianWordNet(SerWN)byadjustingPrincetonWordNet(PWN)synsetsusing several bilingual English-Serbian resources. PWN synset deﬁnitions were automatically translated and post-edited, if needed, while candidate literals for Serbian synsets were obtained automatically from a list of translational equivalents compiled form bilingual resources. Preliminary results obtained from a setof1248selectedPWNsynsetsshowthattheproducedSerbiansynsetscontain 4024 literals, out of which 2278 were offered by the system we present in this paper, whereas experts added the remaining 1746. Approximately one half of ...

WordNet, bilingual resources, term alignment, parallel lists

... phrasets. In Proceedings of the Tenth Conference on European Chapter of the Association for Computational Linguistics - Volume 2, EACL ’03, pages 67–70, Stroudsburg, PA, USA. Association for Computational Linguistics. Bentivogli, L., Forner, P., Magnini, B., and Pianta, E. (2004). Revising the wordnet domains ...
... Enrichment | Ranka Stanković, Miljana Mladenović, Ivan Obradović, Marko Vitas, Cvetana Krstev | Proceedings of the Third International Conference Computational Linguistics in Bulgaria (CLIB 2018), May 27-29, 2018, Sofia, Bulgaria | 2018 | | http://dr.rgf.bg.ac.rs/s/repo/item/0002014 Дигитални репозиторијум ...
... balancing. In Proceedings of the Workshop on Multilingual Linguistic Ressources, MLR ’04, pages 101–108, Stroudsburg, PA, USA. Association for Computational Linguistics. Bhingardive, S., Ajotikar, T., Kulkarni, I., Kulkarni, M., and Bhattacharyya, P. (2014). Beyond lexical units: Enriching wordnets with phrasets ...
Ranka Stanković, Miljana Mladenović, Ivan Obradović, Marko Vitas, Cvetana Krstev. "Resource-based WordNet Augmentation and Enrichment" in Proceedings of the Third International Conference Computational Linguistics in Bulgaria (CLIB 2018), May 27-29, 2018, Sofia, Bulgaria, Sofia : The Institute for Bulgarian Language Prof. Lyubomir Andreychin, Bulgarian Academy of Sciences (2018)
FrameNet Lexical Database: Presenting a Few Frames Within the Risk Domain

Aleksandra Marković, Ranka Stanković, Natalija Tomić, Olivera Kitanović (2021)

U radu se daje kratak prikaz teorije semantike okvira, na kojoj je zasnovana leksička baza Frejmnet. Predstavljena je koncepcija ove mreže, kao i mogućnosti njene primene. Predstavljena je i leksička analiza koja se primenjuje u projektu izrade Frejmneta i ukazano na razlike između analize zasnovane na okviru u odnosu na analizu zasnovanu na reči. Zatim je prikazano nekoliko povezanih okvira koje prizivaju reči iz domena rizika. U radu je predstavljena i platforma NLTК pomoću koje se mogu koristiti ...

Srpski jezik, semantika okvira, FrameNet, scenario rizika, rudarski korpus, obrada prirodnog jezika

... semantic roles.” Computational linguistics 28 (3): 245–288. Infotheca Vol. 21, No. 1, September 2021 31 Marković A. et al., FrameNet Lexical Database. . . , pp. 7–33 Hamilton, Craig, Svenja Adolphs, and Brigitte Nerlich. 2007. “The meanings of ‘risk’: A view from corpus linguistics.” Discourse & Society ...
... s of the Corpus Linguistics Conference, 14:17. Pradhan, Sameer, Wayne Ward, Kadri Hacioglu, James H Martin, and Dan Jurafsky. 2005. “Semantic role labeling using different syntactic views.” In Proceedings of the 43rd Annual Meeting of the Association for Com- putational Linguistics (ACL’05), 581–588 ...
... Atkins, Beryl T. S. 1994. “Analyzing the verbs of seeing: a frame semantics approach to corpus lexicography.” In Annual Meeting of the Berkeley Linguistics Society, 20:42–56. 1. 30 Infotheca Vol. 21, No. 1, September 2021 Scientific paper Atkins, Sue, Charles J Fillmore, and Christopher R Johnson ...
Aleksandra Marković, Ranka Stanković, Natalija Tomić, Olivera Kitanović. "FrameNet Lexical Database: Presenting a Few Frames Within the Risk Domain" in Infotheca, Faculty of Philology, University of Belgrade (2021). https://doi.org/10.18485/infotheca.2021.21.1.1
Multi-word Expressions for Abusive Speech Detection in Serbian

Ranka Stanković, Jelena Mitrović, Danka Jokić, Cvetana Krstev (2020)

Ovaj rad predstavlja istraživanja na usavršavanju i unapređenju srpske verzije rečnika Hurtlex, višejezičnog leksikona uvredljivih reči. Posebnu pažnju posvećujemo dodavanju izraza sa više reči (polileksemskih jedinica) koji se mogu smatrati uvredljivim, jer su takvi leksički zapisi veoma važni za postizanje dobrih rezultata u mnoštvu zadataka otkrivanja uvredljivog jezika. Srpski morfološki rečnici se koriste kao osnova za čišćenje podataka i stvaranje rečnika. Istaknuta je veza sa drugim leksičkim i semantičkim resursima na srpskom jeziku i predviđena je izgradnja sistema za ...

uvredljiv govor, govor mržnje, leksički izvori, višejezični leksikon, izrazi sa više reči

... and Duško Vitas. 2020. Analysis of similes in serbian literary texts (1840- 1920) using computational methods. In Svetla Koeva, editor, Proceedings of the Fourth International Confer- ence Computational Linguistics in Bulgaria (CLIB 2020). Institute for Bulgarian Language “Prof. Lyubomir Andreychin”, ...
... Elisa Bassignana, Valerio Basile, and Viviana Patti. 2018. Hurtlex: A multilingual lexicon of words to hurt. In 5th Italian Conference on Computational Linguistics, CLiC-it 2018, volume 2253, pages 1–6. CEUR-WS. Tommaso Caselli, Valerio Basile, Jelena Mitrović, Inga Kartoziya, and Michael Granitzer ...
... detection: A hybrid approach with deep learning and a multilingual lexicon. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics: Student Research Workshop, pages 363–370. Endang Wahyu Pamungkas, Alessandra Teresa Cignarella, Valerio Basile, and V. Patti. 2018. Automatic ...
Ranka Stanković, Jelena Mitrović, Danka Jokić, Cvetana Krstev. "Multi-word Expressions for Abusive Speech Detection in Serbian" in Proceedings of the Joint Workshop on Multiword Expressions and Electronic Lexicons, Association for Computational Linguistics (2020)
Bilingual lexical extraction based on word alignment for improving corpus search

Jelena Andonovski, Branislava Šandrih, Olivera Kitanović (2019)

Library and Information Sciences,Computer Science Applications

Jelena Andonovski, Branislava Šandrih, Olivera Kitanović. "Bilingual lexical extraction based on word alignment for improving corpus search" in The Electronic Library, Emerald (2019). https://doi.org/10.1108/EL-03-2019-0056

Претрага

68 items

Development of Open Educational Resources (OER) for Natural Language Processing cite

Веб-алат за управљање грађом Речника САНУ и анотација листића cite

EUROLAN 2021: Introduction to Linked Data for Linguistics Online Training School cite

Frequency and Length of Syllables in Serbian cite

A Method for Extracting Translational Equivalents from Aligned Texts cite

Quantitative analysis of syllable properties in Croatian, Serbian, Russian, and Ukrainian cite

E-Dictionaries and Finite-State Automata for the Recognition of Named Entities cite

Corpus-based bilingual terminology extraction in the power engineering domain cite

Extraction of Bilingual Terminology Using Graphs, Dictionaries and GIZA++ cite

Digital Library From A Domain Of Criminalistics As A Foundation For A Forensic Text Analysis cite

A Multilingual Evaluation Dataset for Monolingual Word Sense Alignment cite

Rule-based Automatic Multi-word Term Extraction and Lemmatization cite

Two approaches to compilation of bilingual multi-word terminology lists from lexical resources cite

Ontološki model upravljanja rizikom u rudarstvu cite

From ELTeC Text Collection Metadata and Named Entities to Linked-data (and Back) cite

Using English Baits to Catch Serbian Multi-Word Terminology cite

Resource-based WordNet Augmentation and Enrichment cite

FrameNet Lexical Database: Presenting a Few Frames Within the Risk Domain cite

Multi-word Expressions for Abusive Speech Detection in Serbian cite

Bilingual lexical extraction based on word alignment for improving corpus search cite

Development of Open Educational Resources (OER) for Natural Language Processing

Веб-алат за управљање грађом Речника САНУ и анотација листића

EUROLAN 2021: Introduction to Linked Data for Linguistics Online Training School

Frequency and Length of Syllables in Serbian

A Method for Extracting Translational Equivalents from Aligned Texts

Quantitative analysis of syllable properties in Croatian, Serbian, Russian, and Ukrainian

E-Dictionaries and Finite-State Automata for the Recognition of Named Entities

Corpus-based bilingual terminology extraction in the power engineering domain

Extraction of Bilingual Terminology Using Graphs, Dictionaries and GIZA++

Digital Library From A Domain Of Criminalistics As A Foundation For A Forensic Text Analysis

A Multilingual Evaluation Dataset for Monolingual Word Sense Alignment

Rule-based Automatic Multi-word Term Extraction and Lemmatization

Two approaches to compilation of bilingual multi-word terminology lists from lexical resources

Ontološki model upravljanja rizikom u rudarstvu

From ELTeC Text Collection Metadata and Named Entities to Linked-data (and Back)

Using English Baits to Catch Serbian Multi-Word Terminology

Resource-based WordNet Augmentation and Enrichment

FrameNet Lexical Database: Presenting a Few Frames Within the Risk Domain

Multi-word Expressions for Abusive Speech Detection in Serbian

Bilingual lexical extraction based on word alignment for improving corpus search